Attention-guided chained context aggregation for semantic segmentation
نویسندگان
چکیده
The way features propagate in Fully Convolutional Networks is of momentous importance to capture multi-scale contexts for obtaining precise segmentation masks. This paper proposes a novel series-parallel hybrid paradigm called the Chained Context Aggregation Module (CAM) enrich feature representation. CAM gains various spatial scales through chain-connected ladder-style information flows and fuses them two-stage process, namely pre-fusion re-fusion. serial flow continuously increases receptive fields output neurons those parallel encode different region-based contexts. Each shallow encoder-decoder with appropriate down-sampling sufficiently contextual information. We further adopt an attention model guide Based on these developments, we construct Network (CANet), which employs asymmetric decoder recover details prediction maps. conduct extensive experiments six challenging datasets, including Pascal VOC 2012, Context, Cityscapes, CamVid, SUN-RGBD GATECH. Results evidence that CANet achieves state-of-the-art or competitive performance.
منابع مشابه
Context Encoding for Semantic Segmentation
Recent work has made significant progress in improving spatial resolution for pixelwise labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous convolution, utilizing multi-scale features and refining boundaries. In this paper, we explore the impact of global contextual information in semantic segmentation by introducing the Context Encoding Module, which captures ...
متن کاملObject Boundary Guided Semantic Segmentation
Semantic segmentation has been a major topic in computer vision, and has played an important role in understanding object classes as well as object localizations. Recent development in deep learning, especially in fully-convolutional neural network, has enabled pixel-level labeling for more accurate results. However most of the previous works, including FCN, did not take object boundary into co...
متن کاملSemantic Segmentation with Reverse Attention
Recent development in fully convolutional neural network enables efficient end-to-end learning of semantic segmentation. Traditionally, the convolutional classifiers are taught to learn the representative semantic features of labeled semantic objects. In this work, we propose a reverse attention network (RAN) architecture that trains the network to capture the opposite concept (i.e., what are n...
متن کاملContext Tricks for Cheap Semantic Segmentation
Accurate semantic labeling of image pixels is difficult because intra-class variability is often greater than inter-class variability. In turn, fast semantic segmentation is hard because accurate models are usually too complicated to also run quickly at test-time. Our experience with building and running semantic segmentation systems has also shown a reasonably obvious bottleneck on model compl...
متن کاملMixed context networks for semantic segmentation
Semantic segmentation is challenging as it requires both object-level information and pixel-level accuracy. Recently, FCN-based systems gained great improvement in this area. Unlike classification networks, combining features of different layers plays an important role in these dense prediction models, as these features contains information of different levels. A number of models have been prop...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Image and Vision Computing
سال: 2021
ISSN: ['0262-8856', '1872-8138']
DOI: https://doi.org/10.1016/j.imavis.2021.104309